ON RATES OF CONVERGENCE AND BERRY-ESSEEN BOUNDS 
FOR RANDOM SUMS OF CENTERED RANDOM VARIABLES 
WITH FINITE THIRD MOMENTS 



CHRISTIAN DOBLER 

Abstract. We show, how the classical Berry-Esseen theorem for normal approx- 
imation may be used to derive rates of convergence for random sums of centerd, 
real- valued random variables with respect to a certain class of probability metrics, 
including the Kolmogorov and the Wasserstein distances. This technique is ap- 
plied to several examples, including the approximation by a Laplace distribution 
of a geometric sum of centered random variables with finite third moment, where 
a concrete Berry-Esseen bound is derived. This bound reduces to a bound of the 
supposedly optimal order ^ in the i.i.d. case. 



1. Introduction 
Random sums are random variables of the form 



N 



where Xq, Xi, X2, . . . and N are random variables on a common probability space 
(n, P) and where N assumes nonnegative integer values. Random variables of the 
form ([T]) appear frequently in modern probabiliy theory, because many models for 
example from physics, finance, reliability and risk theory naturally lead to the con- 
sideration of such sums. Furthermore, sometimes a model, which looks completely 
different from ([T]) at the outset, may be transformed into a random sum and then 
general theory of such sums may be invoked to study the original model |GK96) . 
There already exists some literature on the asymptotic distributions of such ran- 
dom sums, see e.g. |GK96] for general distributions of the index N and |Kal97] 
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for the special case of geometric sums, where N = Np has the geometric dis- 
tribution on Z n [1,00) with parameter p G (0,1). Recall that this means that 
P{N = k) = p{l — pY~^ for = 1,2, . . . . In |Kal97] . also quantitative theorems, 
giving bounds on the approximation error, are derived but mainly nonnegative sum- 
mands Xj are considered. Contrarily, the theorems from | ,GK96] are very general, 
not even the existence of moments is assumed at the outset, but are only qualitative 
in that they are not giving any error bounds. In |PR11| Stein's method for the expo- 
nential distribution together with a coupling construction, relying on the equilibrium 
transformation from renewal theory, are used to prove error bounds for the conver- 
gence of geometric (as well as more general) random sums of nonnegative (possibly 
dependent) random variables to the exponential distribution, generalizing a classical 
theorem by Renyi. Recently, in |PR12| a suitable version of Stein's method for the 
Laplace distribution and a distributional transformation in the spirit of the theory 
from |PR11| were developed to prove rates of convergence for geometric (as well as 
more general) sums of centered random variables with finite third moments to the 
Laplace distribution. At least in the special case of a geometric sum, the theorems 
from pPRl2j seem to yield optimal rates of convergence in the bounded Lipschitz 
metric, but due to the distributional transformation, which is used for the coupling 
construction, the summands Xj have to fuUfil a certain symmetry condition in order 
that the coupling can be constructed. 

The purpose of the present paper is to show how one can use the classical Berry- 
Esseen theorem for normal approximation along with asymptotic results on the index 
N to obtain rates of convergence for suitably standardized versions of the sum ([T]). 
This is in a way analogous to the transfer theorems from |GK96| . 
The paper is organized as follows. In Section [2] we present the general setup and 
some abstract results, which help bound the distance from (a scaled version of) the 
random variable S* to a suitable limit distribution. Then, in Section [3] we prove rates 
of convergence for several distributions of the index A^, including the geometric, the 
Poisson and the binomial distributions. 



2. Abstract approximation results 

Througout assume that the random variables A^, Xi, X2, . . . are independent, that 
the Xj are centered having finite third moments S,j '■= E\Xj\^ and that < /i : = 
E[N] < 00. Furthermore, we assume < cr| := -EfA"!] = Var(Xj), j > 1, and define 



N 



(2) 
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Then, letting s% := VaT{W\N) it is easy to see that = /i ^ ^j=i '^j ^^'^ that 
E[W\N] = 0. Furthermore, since Var(Vr) = E[VaY{W\N)]+VaT{E[W\N]), it follows 
that 



Var{W) = E[s%] = f^-'J^H = ^) 

n=0 j=l 



oo oo 



(3) = E E = ^) = E ^ • 

j=l n=j j=l 

Now, assume that Zi, Z2, . . . are further random variables on the same probability 
space such that Zj ~ A^(0, aj), j > 1, and that also A^, Zi, Z2, . . . are independent. 
Then, we define the random variable 



N 



(4) Z := Yl • 

i=i 

It is clear that C{Z\N) = N{0,s%) and hence, Var(Z|iV) = s% = VaT{W\N). For 
a given class "H of Borel-measurable functions /i : M — > M consider the distance 



(5) z/) := sup / hd^ — / hdu 

hen Jr Jr 

on the space of probability measures with respect to which every h E is inte- 
grable. For instance, if H = W is the class of 1-Lipschitz functions, then reduces 
to the Wasserstein distance of /i and u. On the other hand, if "H = /C = {l(_oo,z] : 
z G M}, then die is called the Kolmogorov distance. Both of these distances are in fact 
metrics, which yield stronger topologies on their domains than the topology induced 
by weak convergence (see, e.g. |Dud02j ). 

For our approximation theorems we need the following result, which is an instance 
of the classical Berry-Esseen theorem (see, e.g. |Fel71j ). 

Proposition 2.1. Under the above assumptions we have the following hounds: 
(a) dy,{C{W\N),N{Q,sl)) <^Y.Ui^ 



(b) rf^(£(H^|iv),iv(o,4)) < ^E7=ie, 



Here, Cyv o-nd C/c are absolute and finite constants. 

Remark 2.2. The search for the optimal value of the constant C/c, the Berry-Esseen 
constant, is still going on. As far as we know, the best proven bound is Cic < 0.56 
by Shevtsova [ShelO] . 
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Using Stein's method (see [CS05| ). one may obtain Cw < 6, which seems to be far 
from optimaL 



Proof of Proposition \2.1\ Part (b) follows immediately from the classical Berry- Esseen 
theorem (see |Fel71j ) , since the Kolmogorov distance is scale-invariant. Since for 
random variables X and Y (with existing first moments) and o" > we have 

dy^{C{aX),C{(yy)) = adw{C{X),C{Y)) , 
the claim of (a) follows from the Berry-Esseen result for the Wasserstein distance 
(see |CS05] . for example). 

□ 

Now, let h & T-L he a given test function. Then, 



\E[h{W)] - E[h{Z)]\ = \E[E[h{W)\N] - E[h{Z)\N 
< E\E[h{W)\N] -E[h{Z)\N]\ 
<E\dn{C{W\N),C{Z\N))\ 



(6) 



E 



dn{CiW\N),CiZ\N)) 



and hence, by taking the supremum over h eH, we obtain from ([6]) that 



(7) 



dn{C{W),CiZ)) <E dn{C{W\N),C{Z\N)) 



Recalling that C{Z\N) = N{0,s1f) by Proposition 12.11 this leads to the following 
theorem. 

Proposition 2.3. Under our general assumptions, we have the following bounds: 

(a) d^{C{W),C{Z)) < ^E[^j:f=i^j 

(b) d4c{w),c{z)) < ^e[^zU^. 

Corollary 2.4. Assume that, additionally to our general assumptions, the Xj are 
identically distributed with := and C, '■= E\Xi\'^. Then, 

(a) d^{C{W),C{Z))<^ and 

(b) d4C{W),CiZ))<^E[^]. 

Note that the bounds from Proposition 12.31 and from Corollary 12.41 tend to con- 
verge to zero as /i — )■ oo, i.e. as the number of summands is infinitely growing. 
The next step is to find bounds on the distance (i^ (£(Z), £(y)) for some random 
variable Y. In order to motivate the concrete form of Y let us for the moment 
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assume that we are given a whole sequence (A^a;)a:>i of random indices, such that 

Hk '■= E[Nk] — 7- oo as A; — 7- oo and that fi^^^Nk -H- U, where U is some nonnegative ran- 
dom variable. Furthermore, suppose that the Xj are identically distributed. Let us 
consider the characteristic function Xzw of ^he random variable Z^'"^ := '^f=i 
For t G M we obtain 



(8) = E[e-''^k/^] = E[e-—- 



Since the mapping [0, oo) 3 x ^ e a'^GMis bounded and continuous for each 
t G M we conclude that 



(9) XziMt)'^E[e~'-^^]=:m- 

Note that = 1 and that is continuous at by the dominated convergence 
theorem and hence, is a characteristic function of some random variable Y. It is easy 

to see by characteristic functions that Y = ay/U(, where ( ~ N{0, 1) is independent 
of U. Thus, we can conclude that 

(10) Z^''^ 4 r := aVuC as A; ->cx) . 

If we also let := J2f=i and if we assume that the bounds from Corol- 

lary [231 (with Z replaced by Z^'^^ and W by W^''^) tend to zero as k goes to infinity, 
we can conclude that also 



(11) ^Y = aVuC ask^oo. 

The distribution C{Y) is a scale mixture of the standard normal distribution and 
the result (fTT]) is in accordance with the theory from Chapter 3 of |GK96| . see for 
instance Theorem 3.3.2. 

Now, the next task is to derive concrete error bounds on the distance 
c?-^ (£(Z), £(y)) from bounds on the distance (i-^ (£(/i~^A^), £([/)) . This appears to 
be possible, since in the case of identically distributed Xj we have with ( independent 

of that Z = (Ta/ fi~^N(, as is easily checked. We will provide two propositions for 
the iid case, the first bounding the Kolmogorov distance of C{Z) and C{Y) in terms 
of dfc{C{fi~^ N) , C{U)) and the second giving bounds on the Wasserstein distance of 
C{Z) and A^(0, a^) in the case that ?7 = 1 is constant. Then we will turn to the case 
of not necessarily identically distributed summands Xj. 
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Proposition 2.5. Let ( ~ iV(0, 1) be independent of N and ofU. Then, for every 
a > 

Proof. For z G M we have 



\P{ctVuC <z)- P{a^/i2-^NC < z)\ 

= E[P{(rVUC < z\C) - P{a^/JF^C < 



(12) 



< E 



p{aVuc < z\C) - P{a^JF^C < AO 



Now, for each s G M by the independence of Q and U and of and 



P{aVuC < z\C = s) - P{ay/fi-^NC < z\C = s] 



s>0 
s = 
s < 



(13) 



<d4ciVu),ci^/J^)) 



since the Kolmogorov distance is also induced by the test functions l[t,oo); t G M. 
Now, recall that the random variables fi~^N and U are nonnegative and, hence, for 
each t > we have 



\PiVu <t)- P{^/IF^ <t)\ = \Piu < t^) - P{ij,-^N < t^)\ 

<d,c{C{U),C{f^-'N)), 
and taking the supremum over t > yields 



(14) d4CiVu),Ci^/F^)) <dic{CiU),Cifi-'N)) 

Thus, from (fT2|) . (fT3|) and (fT4|) we obtain for every 2; G M 



\P{aVUC < z) 



proving the claim. 



P{a^JF^C <z)\< E[d^{C{U),C{^-'N))] 
= d4C{U),C{fi-'N)), 



□ 
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Proposition 2.6. //C ~ N{0, 1) is independent of N, then for every a > 



Remark 2.7. Of course, the conclusion of Proposition 12.61 can only yield useful 
bounds, if normal approximation of C{W) makes sense. The upper bound shows 
that Var(A^) should be of smaller order than /x^ = {E[N]y in order for normal 
approximation to be plausible. 

Proposition 12.61 immediately follows from the following two lemmas. 

Lemma 2.8. IfU>0 and ( ~ ^(0, 1) is independent ofU and of N, then 





-d^{ciVu),c{^/iF^)). 



Proof. Let /i : M — )■ M be 1-Lipschitz. Then, 



E[hiaVUC)]-E[h{a^/JI^C)]\ 



= E[E[hiaVuC)\C]- E[hia^/JF^C)\C]\ 



< E E[hiaVuC)\C] - E[hia,/JF^Cm 



(15) 




Now, for every s G M by independence we have 



E[h{aVU(;) - h{ay/fi~^NC) \C = s] 



= E[h{asVu) - h{as^/^x-^N)] 



(16) 



< \s\adw{CiVu), C{^/^l-^N)) . 



Hence, from ( ITSl) 



and ( IT6l) we obtain that 



E[hiaVUC)]-E[hiay^JF^C)] 



<crE[\(\]d^{^VU),^^/F^)) 




as claimed. 
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□ 



Lemma 2.9. Suppose that U = 1 is constant. Then, 



/i 

Proof. For a nonnegative real number x we have the inequahty 



(17) |l-a;| = ^ —<\l-x'^\ 



1 + x 



From (fT7|) with x = \/ yr^N and for each 1-Lipschitz function h we obtain 



< E\l - V/U-iA^ 



<E\l- fi-^N\ < v/Var(/i-iiV) 
_ v/Var(iV) 



□ 



Now, we turn to the general case that the Xj are no more supposed to be identically 
distributed. Here, we will assume that 

1 " 

(19) < := lim - Vor^ < oo 

i=i 

exists. Note that in the iid case = = - Y^" , cr? for each n > 1. We further 
define 



(20) a^=^a|. 
Note that we have the general representation 

(21) Z = ^/JF^anC ■ 
The following lemma will be useful. 

Lemma 2.10. Leta,T e (0, oo). Then, 

(a) rfyc(iV(0,cr2),iV(0,r2)) <min(|l-^|,|l-^|) and 
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(b) rfw(iV(0, iV(0, r2)) < 2 min(a|l - r|l - ^|) . 

Proof. The proof uses Stein's method for univariate normal approximation. Let 
( ~ A^(0, 1). We first prove (a). Let t e M be arbitrary. Then, 

|PK < t) - P(rC < t)\ = |P(C < -) - P(-C < -)| 

(22) <ci^(iV(0,l),iV(0,^)). 
Hence, by symmetry, 

(23) 



dK (iV(0, a^), iV(0, r^)) < max rf^ (iV(0, 1), iV(0, — )) , rf^ (iV(0, 1), iV(0, - 



Now, for z G M let be the standard solution to Stein's equation 

(24) /'(a;) - x/(x) = l(_oo,.] - P(C < ^) , 

which is given by fz{x) = ^where (p and $ denote the density function 

and the cumulative distribution function of A^(0, 1), respectively. It is known that 
fz is Lipschitz with ||/^||oo ^ 1 for each z G M and that for each c > and every 
Lipschitz function / it holds that E[c(f{c()] = c^i?[/'(cC)]. Hence, by inserting c( 
into and taking expectations 

im<^)-^(c<^)| = \E[f:{co-cCMcO] 

= \l-cWM)]\<\l-c'\, 

yielding 

ci^(iV(0,l),iV(0,c2))) < \l-c^\. 
By (123|) this implies (a). 

The proof of (b) is similar by using the solution to Stein's equation for a 1-Lipschitz 
test function h. One merely has to take care of the scaling properties of Wasserstein 
distance in the equality corresponding to (122|) and note that in this case ||/^||oo < 2 
for the standard solution fh of Stein's equation (see, e.g. |CGS11| ). 

□ 
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The strategy to keep track of the non-identically distributed case is to use the 
inequahty 

(25) dn[C{Z),C{Y))<dn[C{Z),C{Z))+du{C{Z),C{Y)), 



where Z := a a/ fi^^N( and Y := aVU(. The first term on the right hand side 
of f l2^ may be bounded with the help of Lemma 12.101 (see Lemma I2.1ip and the 
second term may be handled by Proposition 12.51 in the case of non-constant U and 
by Proposition 12. 6[ if f/ = 1 is constant and normal approximation is plausible. We 
will need one more lemma, comparing C{Z) and C{Z). 

Lemma 2.11. With the above definitions we have: 

(a) d^{CiZ),C{Z)) < EZiPiN = n)min(|l - |^|, |l - |||) . 

(b) dn>{CiZ), CiZ)) < 2/i-i/2 V^PiN = n) min(a„| 1 - ||, a| 1 - |||) . 
Proof. Let H be either equal to /C or to W. Then, recalling representation fl2T]) . for 

hen 

\^E[h{Z)] -E[h{Z 

oo 

< Y,P{N = n)\E[h{^l-^'^^a,,C)\ - E[h{^-'/^^aC)\ 

n=l 

oo 

(26) <Y^P{N = n)dn (iV(0, fi-'nal), N{0, fi-'na'] 

n=l 

If "H = /C, then the bound in (l26l) equals 



J2 PiN = n)dfc (iV(0, 6-2), iV(0, a')) <Y,P{N = n) min(| 1 - ^ | , 1 1 - | 

n=l n=l 

by Lemma [2.101 (a). If "H = W, then the bound in (l26l) equals 



/i-V2 ^ ^p(iv = n)dy, (iV(0, a^), iV(0, a 

n=l 

oo „ 2 

< 2/i-^/2 ^ ^P{N = n) min 1 1 - I , a 1 1 



n=l 

by Lemma [2.101 (b). This completes the proof. 



at' ' 0-2 
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□ 



Putting the pieces together, we have the following proposition. 
Proposition 2.12. Under the general assumptions and if (fT^ is satisfied, we have 

(a) d4C{Z),CiY)) < EZim = n)min(|l - |l 
+ diciC{fx-^N),C{U)) and 

(b) dw{C{Z),N{0,a^)) <2fi-'/^ZZiV^PiN = n)mm(dn\l-^ 



a 



VVar(jV) 



+ 



where o"„ was defined in ( 120|) . 



Proof. The claims follow from Propositions 12.51 and 12.61 with a replaced by a to deal 
with the respective second term on the right hand side of (125|) and from Lemma [2. Ill 
to handle the first term. 

□ 



3. Applications 

3.1. Geometric Sums. In this subsection, N = Np will always have the geometric 



distribution on {1, 2, ... } with parameter p G (0, 1), i.e. P(N - 
for each integer n > 1. Note that this implies fi := fip := E[Np] 



n) = p{l — p)" ^ 
= p~^. In order to 



derive moore concrete rates of convergence for geometric sums from Corollary 12. 4[ 
we will first prove the following lemma. 

Lemma 3.1. Let Np have the geometric distribution with parameter p G (0, 1). Then, 



Vp<e 



< 



1 + 



Proof. We will first prove the left-most inequality. Let if : (0, oo) — (0, oo) be defined 



by (fix) 



Then, ip is convex and hence, by Jensen's inequality, we have 



^=^{E\Np\)<E{^[Np)\=E 



1 



Now, note that the right-most inequality is trivial. For the remaining inequality 
we write 



(27) 



E 



p 



1 — p 
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where f{p) := J2T=i -jti^ — p)'' for < p < 1. Note that / is continuous and has 



an analytic extension to (0,2). We have 

f{p) = -J2Vkil- pf-' = —E . 



k=l 



p 



Thus, by the fundamental theorem of calculus, we have 



(2^ 



f{p) = /(I) + J ns)ds 



-ds . 



Since a/x is concave on (0, oo), it follows again by Jensen's inequality that 

1 



E[^/Ns] <Vm7] = —- 
Thus, form (128|) we conclude that 



(29) f{p) < / s-''/^ds = -2s-^'^ 

Jp 

Hence, from (1271) and ( l29l) we obtain 
1 



E 



<^2(^-i)=2Vp--^- 



1-p \^ 



1-p l + ^/p 



proving the inequality. 



□ 



Remark 3.2. Since the function y/x is concave, on could try to make use of the 
inequality 



E 



\ ^ 1 










= E 















for the upper bound. But as one can show that E[N 



P-'] 



application of Jensen's inequality would only yield the worse bound 



-p log p 
1 — p ' 



this direct 



E 



logp\ 1/2 



1 — P 
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From Corollary 12.41 and Lemma [3. II we obtain the following result. 

Proposition 3.3. Let, additionally to the general assumptions, the random vari- 
ables Xj be identically distributed with := -EfXi] and ^ := E\Xi\^, then 

d^{C{W),C{Z))<^^. 

It can easily be checked by means of characteristic functions that pN^ — )■ f/ ~ 
Exp(l) as p J, 0. From Example 3.1 of |PR11] we even have the following quantitative 
version of this result: 



(30) c?^(£(piVp),Exp(l)) < I2p 

From (fTOj) we see that we should approximate C{W) by the distribution of the 
random variable Y = a\/U(, where ( ~ -^(0, 1) is independent of U. With the help 
of chatacteristic functions one can easily show that Y has the Laplace distribution 
Laplace(0, cr/y^), where for a G M and b > the distribution Laplace(a, b) is defined 
by the density fa^ : M — )■ M with 

Our observations now lead at once to the following theorem. 

Theorem 3.4 (Berry-Esseen bound for geometric sums of iid centered random vari- 
ables). Let Xi,X2,... be independent and identically distributed random variables 
with E[Xi] = 0, < CT^ := E[Xf] and ^ := E\Xi\^ < oo and let N = Np be inde- 
pendent of the Xj and have the geometric distribution on {1, 2, ... } with parameter 
p e (0,1). Then, 

rfyc(/:(iy),Laplace(0,^)) < ^^+12p. 

Proof. This follows from Proposition 12. 5^ Proposition 13.31 and fl30p by the triangle 
inequality for die- 

□ 

Now, we turn to not necessarily identically distributed summands Xj. Let in 
accordance with (T„ the quantities ^„ be defined by 



1 " 

(31) in:=-J2^,. 

i=i 

Then, we can prove the following theorem. 
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Theorem 3.5 (Berry-Esseen bound for geometric sums of independent, centered 
random variables). Let Xi, X2, ... be independent random variables with E[Xj] = 0, 
< (j| := and := < 00 for j > 1 and let N = Np be independent of 

the Xj and have the geometric distribution on {1, 2, ... } with parameter p G (0, 1). 
Assume that (fT9|) is satisfied. Then, with defined by (120|) and with ^„ defined by 
( 1ST]) it holds that 



^ 00 ^2^2 

dye Laplace (0, ^)) < 12p + p ^^(1 - pf-^ min(| 1 - |l - ^ 



n=l 

1 iN 



N(T% 



+ CicE 

Proof. Note that since s% = fi^^Nafj, by Proposition 12.31 (b) 

^1 



Furthermore, by Proposition 12.121 (a) and (130|) 



^00 ^2 - 2 

rfyc(/:(^),Laplace(0, ^) < - p)'^"^ min(|l - |l - ^|] + 12p . 

Hence, the claim follows from the triangle inequality. 

□ 



Remark 3.6. A different way to prove Berry-Esseen bounds in the non-identically 
distributed setting is to make use of the bound 



/ 1 \ 12p 

(32) £(s^),Exp(— ^) < — ^sup(t|, 

V E[a^\ J E[a^\ j>i 

which follows from Theorem 3.1 in |PR11] . noting that = -Efo"^] in case of the 

geometric distribution. This, in turn, follows from ([3]) because P{N > j) = {l—p)^~^ 
and = P- Then, by a suitable Proposition, which is completely analogous to 
Proposition I2.5[ one can show that since Z = s^C, 

(33) dyc(/:(^),Laplace(0,(E[a^])i/Vv^)) < -l?|-supa|. 

Thus, we have the following alternative bound in the situation of Theorem 13.51 
(without assuming the existence of o"^): 
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(34) d^(£(iy),Laplace(0,(E[a^])i/Vv^)) <C^i?[-=^j +-Psnpa] 



Note that both, the bound from Theorem 13.51 and reduce to the bound from 
Theorem 13.41 in the iid case. 

3.2. The Poisson case. In this subsection, we discuss the case that = A^^ ~ 
Poisson(A) with A > 0. Note that this imphes E[N] = Var(A^) = A and, hence, by 
Remark 12 . 71 normal approximation of the random sum indexed by N is appropriate, 
as A — )■ oo. Again, we begin with the identically distributed case. 

Theorem 3.7. Let Xi,X2,... be independent and identically distributed random 
variables with E[Xi] = 0, < := E[Xf] and ^ := E\Xi\^ < oo and let N ~ 
Poisson(A), A > 0, be independent of the Xj. Then, 



M£(W'),iV(0,.^))<(^ + .yf)-i=. 
Proof. From Corollary 12.41 (a) we have 

d^{C{W),C{Z))<^^ 
and from Proposition 12.61 it follows that 

d^{C{Z),N{0,a')) <a^^. 
Hence, the assertion follows from the triangle inequality for d\^. 

□ 

The next result generalizes Theorem 13.71 to non-identically distributed summands. 

Theorem 3.8. Let Xi,X2,... be independent random variables with E[Xj] = 0, 
< crj := and := E\Xjf < oo, j > 1, and let N ~ Poisson(A), A > 0, 6e 

independent of the Xj. Assume that f|T9|) is satisfied. Then, with defined by (120|) 
and with ^„ defined by f l3ip it holds that 



d^{C{W),N{0,a')) <a^^ + ^E 



6v 
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Proof. From Proposition 12.31 (a) it follows that 

d„{c{wic{z)) < ^e[^E&] = • 

since = X'^aj^. From Proposition 12. 121 (b) it follows by some minor calculations 
that 



Again, the claim follows from the triangle inequality for dy\;. 

□ 

3.3. The binomial case. In this subsection we let = Nm ~ Binomial(m,p) 
where m > 1 is an integer, which is supposed to be large, and p G (0, 1) is considered 
fixed. Note that this implies E[N] = mp and Var(A^) = mp(l —p). Thus, by Remark 
12.71 normal approximation of the random sum indexed by N is plausible, as m ^ oo. 
As usual, we first state our theorem for identically distributed summands. 

Theorem 3.9. Let Xi,X2,... be independent and identically distributed random 
variables with E[Xi] = 0, < a"^ := E[Xf] and ^ := E\Xi\^ < oo and let N ~ 
Binomial(m,p) be independent of the Xj. Then, 

d^{ciw),Nioy)) < + J^^)-L. 

V (T^ V TT / y^mp 

Proof. The proof, being completely analogous to that of Theorem 13. 7[ is omitted. 

□ 

Remark 3.10. (i) The bound in Theorem 13.91 shows that normal approximation is 
appropriate even if p goes to zero as long as mp goes to infinity. This includes 
for instance the cases p = pm = cm'"" for some constant c and < a < 1. 
(ii) Note that for p = 1 the bound from Theorem 13 . 91 equals the Berry-Esseen bound 
in the Wasserstein distance for a sum of m random variables. This makes sense 
since a random variable A^ ~ Binomial (m, 1) equals m almost surely. 

The next theorem generalizes the last result to the case of not necessarily identi- 
cally distributed summands. 

Theorem 3.11. Let Xi,X2, ... be independent random variables with E[Xj\ = 0, 
< (t| := E[Xj] and := E\Xj\^ < oo, j > 1, and let N ~ Binomial(m,p) be 
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independent of the Xj. Assume that (fT9|) is satisfied. Then, with defined by (120|) 
and with ^„ defined by (13T]) it holds that 



d^{C{W),N{0,a')) < (a 



2 

+ 















p)™-"mm(^a„,|l - 






/m» ^ — ^ 

V n=l 

Proof. The proof, being completely analogous to that of Theorem 13.81 is omitted. 

□ 
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